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Abstract Text for Online or Printed Programs: 

AIRS/ AM SU is the state of the art atmospheric sounding system flying aboard EOS Aqua. These observations, covering the period 
September 2002 until the present, have been analyzed using the AIRS Science Team Version-5 retrieval algorithm. Many 
researchers have used these products to make significant advances in both climate and weather applications. The AIRS Science 
Team Version-6 Retrieval algorithm contains many significant theoretical and practical improvements compared to Version-5 which 
should further enhance the utility of AIRS products for both climate and weather applications. Advances found in the AIRS 
Version-6 retrieval algorithm, as well as early Version-6 results, will be presented. 

Abstract Text for Technical Review Purposes: 

AIRS/AMSU is the state of the art infra-red and microwave atmospheric sounding system flying aboard EOS Aqua. The Goddard 
DISC has analyzed AIRS/AMSU observations, covering the period September 2002 until the present, using the AIRS Science Team 
Version-5 retrieval algorithm. These products have been used by many researchers to make significant advances in both climate 
and weather applications. The AIRS Science Team Version-6 Retrieval, which will become operation in mid-2012, contains many 
significant theoretical and practical improvements compared to Version-5 which should further enhance the utility of AIRS products 
for both climate and weather applications. In particular, major changes have been made with regard to the algorithms used to 1) 
derive surface skin temperature and surface spectral emissivity; 2) generate the initial state used to start the retrieval procedure; 
3) compute Outgoing Longwave Radiation; and 4) determine Quality Control. This paper will describe these advances found in the 
AIRS Version-6 retrieval algorithm and demonstrate the improvement of AIRS Version-6 products compared to those obtained 
using Version-5. 
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ABSTRACT 

The Goddard DISC generated products derived from AIRS/AMSU-A observations, starting from September 2002 when 
the AIRS instrument became stable, using the AIRS Science Team Version-5 retrieval algorithm. The AIRS Science 
Team Version-6 retrieval algorithm became operational at the Goddard DISC in late 2012. This paper describes some of 
the significant improvements in retrieval methodology contained in the Version-6 retrieval algorithm, compared to that 
used in Version-5. In particular, the Science Team made major changes with regard to the algorithms used to 1) derive 
surface skin temperature and surface spectral emissivity; 2) generate the initial state used to start the cloud clearing and 
retrieval procedures; and 3) determine Quality Control. This paper describes these advances found in the AIRS Version-6 
retrieval algorithm and demonstrates the improvements of some AIRS Version-6 products compared to those obtained 
using Version-5. 

Keywords: AIRS/AMSU, high spectral resolution IR sounders, retrieval methodology, IR sounding in cloudy 
conditions, cloud cleared radiances, Quality Control. 

1. INTRODUCTION 

AIRS was launched on EOS Aqua in May 2002, together with AMSU-A and HSB (which subsequently failed early in 
the mission), to form a next generation polar orbiting infrared and microwave atmospheric sounding system 1 . 
AIRS/AMSU had two primary objectives. The first objective was to provide real-time data products available for use by 
the operational Numerical Weather Prediction Centers in a data assimilation mode to improve the skill of their 
subsequent forecasts. The second objective was to provide accurate unbiased sounding products with good spatial 
coverage that are used to generate stable multi-year climate data sets to study the earth’s interannual variability, climate 
processes, and possibly long-term trends. 

AIRS is a grating spectrometer with a number of linear arrays of detectors with each detector sensitive to outgoing 
radiation in a characteristic frequency v t with a spectral band pass Av* of roughly Vj/1200. AIRS contains 2378 spectral 
channels covering portions of the spectral region 650 cm" 1 (15.38 pm) - 2665 cm' 1 (3.752 pm). The spectral sampling 
interval (except for the existence of a few gaps) is Vj/2400, giving two AIRS channels per spectral half width. AIRS is 
accompanied by the temperature sounding 60 GHz microwave instrument AMSU-A. There is a 3x3 array of AIRS 
footprints within a given AMSU-A footprint, with spatial resolutions of 13 km and 45 km at nadir viewing for AIRS and 
AMSU respectively. Each AIRS footprint is referred to as a Field of View (FOV), and the AMSU-A footprint is referred 
to as a Field of Regard (FOR). 

The Goddard DISC generated products derived from AIRS/AMSU-A observations, starting from September 2002 when 
the AIRS instrument became stable, using the AIRS Science Team Version-5 retrieval algorithm. 2 The AIRS Science 
Team Version-6 retrieval algorithm contains many theoretical and practical improvements over that used in Version-5. 
The Goddard DISC started processing new AIRS/AMSU data, as well as the reprocessing of all old AIRS/AMSU data, 
using the AIRS Science Team Version-6 retrieval algorithm in late 2012. The Version-6 retrieval system runs in both in 
an AIRS/AMSU mode as well as in an “AIRS Only” mode, designed as a back-up system to be used in the event that the 
AMSU-A instrument continues to degrade significantly beyond its present functionality. This paper describes some of 
the significant improvements contained in the Version-6 retrieval algorithm, compared to that used in Version-5, with an 
emphasis on the improvement of ocean surface skin temperatures and atmospheric temperature and water vapor profiles. 



The methodology used to analyze AIRS/AMSU observations is essentially unchanged from that described in Susskind et 
al. (2003) 3 . Fundamental to this approach is the generation of clear column radiances for each AIRS channel i, R h which 
are derived products representing the radiance channel i would have seen if the entire 3x3 AIRS FOR were cloud free. R t 
is determined for each channel as a linear combination of the observed radiances of that channel in each of the 9 FOV’s, 
using coefficients that are channel independent. The retrieved geophysical state X is subsequently determined which, 
when substituted in a Radiative Transfer Algorithm (RTA), generates an ensemble of computed radiances R t {X ) which 
are consistent with R t for those channels i used in the determination of X. Cloud-clearing theory 4,5 says that to achieve 
the best results in more stressing cloud conditions, longwave channels sensitive to cloud contamination should be used 
only in the determination of the coefficients used in the generation of clear column radiances for all channels and not for 
sounding purposes. In Version-5 2 , tropospheric sounding 15 pm C0 2 observations were used only in the derivation of the 
cloud clearing coefficients, and temperature profiles were derived using R t in the 4.3 pm C0 2 band as well as in some 
stratospheric 15 pm C0 2 channels that do not see clouds. This new approach allowed for the retrieval of accurate Quality 
Controlled values of R t and T(p) under more stressing cloud conditions than was achievable in Version-4. Version-5 also 
contained a new methodology to provide accurate case-by-case error estimates for retrieved geophysical parameters and 
for channel-by-channel clear column radiances. Thresholds of these error estimates were used in a new approach for 
Quality Control (QC) in Version-5. 

The AIRS Version-6 retrieval algorithm has further significant advances over Version-5. The basic theoretical approach 
used in Version-6 to analyze AIRS/AMSU data is very similar to Version-5 with one major exception. As in Version-5, 
the coefficients used for generation of cloud cleared radiances R t for all channels are determined using observed 
radiances only in longwave 15 pm and 11 pm channels. In Version-5, tropospheric temperatures were retrieved using 
only R t in the AIRS shortwave 4.2 pm C0 2 channels, but surface skin temperature was retrieved simultaneously, along 
with surface spectral emissivity and bi-directional reflectance, using observations both in the longwave 8-12 pm window 
region and in the shortwave 4.0 pm - 3.76 pm window region. In Version-6, only window observations in the shortwave 
window region, 4.0 pm - 3.76 pm, are used to simultaneously determine surface skin temperatures along with shortwave 
surface spectral emissivities and surface bi-directional reflectance. Longwave surface spectral emissivity is retrieved in a 
subsequent step. Another significant improvement found in Version-6 is the use of an initial guess X° generated by 
using Neural-Net methodology 6,7 in place of the previously used regression approach. These modifications have resulted 
in significant improvement in the ability to obtain both accurate temperature profiles and surface skin temperatures under 
more stressing partial cloud cover conditions. The Version-6 retrieval algorithm also has significant further 
improvements over Version-5 in its Quality Control methodology, as well as other improvements that are not discussed 
in this paper. 


2. STEPS IN THE VERSION-6 RETRIEVAL ALGORITHM 

Retrievals of all geophysical parameters are physically based and represent states Xj c determined for case c that best 
match a set of clear column radiances R i c for the subset of AIRS channels i used in the retrieval process. Retrievals of 
geophysical parameters are performed sequentially, that is, only a subset of the geophysical parameters within the state 
Xj is modified from that of the incoming state X® in a given step. A GCM forecast is not used in any way in the retrieval 
procedure, except for use of the forecasted surface pressure p surf as the lower pressure boundary when computing 
expected radiances R t (X ) for a given geophysical state Xj. 

In Version-5 2 , the steps in the physical retrieval process were as follows: A regression-based start-up procedure was used 
to generate the initial state X°. Initial clear column radiances R ° were generated for all channels k using the initial state 
X° and cloud-clearing coefficients generated using observed radiances in an ensemble of cloud clearing channels 2 . The 
state A 0 is also used as the initial guess to the physical retrieval process in which AIRS/AMSU observations used to 
retrieve: a) surface skin temperature, surface spectral emissivity and surface bi-directional reflectance of solar radiation; 
b) atmospheric temperature profile; c) atmospheric moisture profile; d) atmospheric ozone profile; e) atmospheric CO 
profile; f) atmospheric CH 4 profile; and g) cloud properties and OLR. These steps were done sequentially, solving only 
for the variables to be determined in each retrieval step while using previously determined variables as fixed with an 
appropriate uncertainty attached to them which was accounted for in the channel noise covariance matrix used in that 
step. The objective in each step (a-f) was to find solutions which best match R t for the subset of channels selected for use 



in that step, bearing in mind the channel noise covariance matrix. Steps a-f were ordered so as to allow for selection of 
channels in each step which are primarily sensitive to variables to be determined in that step or determined in a previous 
step, and are relatively insensitive to other parameters. Separation of the problem in this manner allowed for the problem 
in each step to be made as linear as possible. Step g is performed using the observed radiances R t after the surface and 
atmospheric conditions have been determined. 

In Version-6, there is a slight modification to the sequence of steps used in Version-5, as there is a new step performed in 
the retrieval sequence. In Version-5, step a) used channels in both the longwave and shortwave window regions and 
simultaneously solved for surface skin temperature T s , shortwave surface spectral emissivity £ sw (v) and surface spectral 
bi-directional reflectance p sw ( v )? an d longwave surface spectral emissivity £^ w (v). In Version-6, only shortwave 
window channels are used in this retrieval step to simultaneously determine T s , £ sw (v), and p 5W (v). The longwave 
surface spectral emissivity £^ w (v) is now solved for in a subsequent step using only channels in the longwave window 
spectral region. This new step is performed after the humidity profile retrieval step because longwave window radiances 
can be very sensitive to the amount of atmospheric water vapor. The steps used in the Version-6 AO (AIRS Only) 
algorithm are otherwise identical, but no AMSU-A observations are used in the physical retrieval process or in the 
generation of the initial state A 0 . In addition, as stated previously, X° is determined in Version-6 via a Neural Net 
methodology rather than by regression. 

3. CHANNELS AND FUNCTIONS USED IN DIFFERENT STEPS OF VERSION-6 

Figure 1 shows a typical AIRS cloud free brightness temperature spectrum and includes the channels used in Version-6 
and Version-6 AO for cloud clearing, as well as in each of the different steps of the AIRS physical retrieval algorithm. 
These channels are described below. 

3.1 Cloud clearing and temperature profile retrieval 

Following cloud-clearing theory, coefficients needed to generate clear column radiances for all channels are determined 
using observations in 57 longwave channels, ranging from 701 cm" 1 to 1228 cm" 1 , which we show in yellow in Figure 1. 
These same channels are also the ones used in the cloud parameter retrieval step. The temperature profile retrieval step 
uses 37 channels between 2358 cm' 1 and 2395 cm" 1 that are sensitive to both stratospheric and tropospheric temperatures, 
as well as 53 stratospheric sounding channels between 662 cm" 1 and 713 cm" 1 that are not sensitive to cloud 
contamination. We show these channels in red in Figure 1. Version-6 also includes 24 additional channels in the 
temperature profile retrieval step between 2396 cm" 1 and 2418 cm" 1 , also shown in red, that are used in both the 
temperature profile and the surface parameter retrieval steps. Version-6 includes use of AMSU-A channels 3, 6 and 8-14 
in the temperature profile retrieval step as well, while Version-6 AO does not. Version-5 also included AMSU-A 
channels 4 and 5 in the temperature profile retrieval step, but those channels became noisy in 2007 and are no longer 
used. AMSU-A channel 7 was noisy at launch and was never used in any step of the retrieval process. 

3.2 Surface skin temperature and longwave spectral emissivity retrievals 

The surface skin temperature retrieval step uses 36 channels between 2420 cm" 1 and 2664 cm" 1 , which we show in light 
blue in Figure 1, along with the 24 highest frequency channels which are also used in the temperature profile retrieval 
step. These 60 channels are used to determine the surface skin temperature simultaneously with four independent pieces 
of information about surface shortwave spectral emissivity and four additional independent pieces of information of 
shortwave surface bi-directional reflectance during the day. Therefore the skin temperature retrieval steps solves for nine 
unknowns during the day and five unknowns at night. Surface longwave spectral emissivity is determined using 77 
channels, between 758 cm" 1 and 1250 cm" 1 , which we show in purple in Figure 1. In this step, three longwave emissivity 
perturbation functions are solved for, with T s being held fixed at the value determined from the previous skin temperature 
retrieval step. The initial guess for surface spectral emissivity in both retrieval steps, £°, is set equal to the values found 
in the AIRS Science Team ocean emissivity model over non- frozen ocean. Over land and frozen ocean, £° is set equal to 
values coming from the 1° x 1° monthly mean MODIS Science Team surface spectral emissivity data set. 
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3.3 Constituent profile retrievals 


As in Version-5, constituent profile retrievals are performed in separate subsequent steps, each having their own set of 
channels and functions. Figure 1 shows, in different colors, the Version-6 channels used in each of these retrieval steps. 
The q(p) retrieval (pink stars) uses 41 channels in the spectral ranges 1310 cm" 1 to 1605 cm' 1 and 2608 cm" 1 to 2656 cm" 1 ; 
the 0 3 (p) retrieval (green stars) uses 41 channels between 997 cm" 1 and 1069 cm" 1 ; the CO(p) retrieval (gray stars) 
uses 36 channels between 2181 cm" 1 and 2221 cm" 1 ; and the CH 4 (p) retrieval (brown stars) uses 58 channels between 
1220 cm" 1 and 1356 cm" 1 . 


4. COMPARISON OF QUALITY CONTROLLED VERSION-6 AND VERSION-6 AO 
RETRIEVALS WITH THOSE OF VERSION-5 

Our evaluation compares V6 and V6 AIRS Only (V6 AO) Quality Controlled products with those of Version-5. In the 
subsequent sections, we evaluated ocean surface skin temperature T s and surface emissivity £ v , temperature profile T(p), 
and water vapor profile q(p). Our evaluations compare results obtained on seven focus days to collocated ECMWF truth. 
The seven focus days are: September 6, 2002; January 25, 2003; September 29, 2004; August 5, 2005; February 24, 
2007; August 10, 2007; and May 30, 2010. All products have quality flags based on thresholds of error estimates. Both 
Version-5 and Version-6 use quality flags for the level-2 output products with the general scheme that QC=0 indicates 
the best quality products designated for use in a data assimilation application. Products flagged with QC=1 are of good 
quality designated to be included along with those with QC=0 in the generation of gridded Level-3 products used for 
climate research. Products flagged with QC=2 should not be used for any purpose. 

4.1 Surface skin temperature T s and surface spectral emissivity s v 

Figure 2 shows counts of all Quality Controlled Ocean Surface Skin Temperatures over the latitude range, - 50°S, 
as a function of the difference between T s and ECMWF “truth” for the 7-day evaluation period. We show the counts of 
Version-5 retrievals in red and pink, Version-6 retrievals in dark blue and light blue, and Version-6 AO retrievals in 
black and gray. The lighter shade of each color shows counts of best quality T s retrievals with tight error estimate 


thresholds (QC=0). The darker shade shows counts of both best and good quality T s retrievals including cases with QC=0 
or 1, where the error estimate thresholds for QC=lare looser than those for QC=0. 
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Ocean T s retrievals with QC=0 or 1 are the ensemble used to generate the Level-3 Oceanic SST product used for climate 
studies. Figure 2 also contains statistics for each set of retrievals showing the mean difference from ECMWF, the 
standard deviation of the ensemble differences, the percentage of all possible cases included in the Quality Controlled 
ensemble, and percentage of all accepted cases with absolute differences from ECMWF of more than 3K from the mean 
difference. Such cases are referred to as outliers. 


Version-6 QC’d retrievals accept considerably more cases than Version-5 and have much lower standard deviations of 
the errors as well. In both ensembles, the percentage of outliers grows with loosening the QC thresholds, as expected. 
Version-6 outliers with QC=0,1 are somewhat larger than Version-5, but the yield is more than twice as large. It is 
noteworthy that Version-6 retrievals with QC=0 have a much smaller percentage of outliers than do Version-5 retrievals 
with QC=0,1 along with a substantially higher yield. One point of slight concern in this figure is that the cold mean bias 
in Version-6 retrievals compared to ECMWF is somewhat larger than that of Version-5. Statistics of QC’d Version-6 AO 
retrievals are very similar to those of Version-6. 

Figure 3 shows the spatial distribution of the seven day mean differences of the Level-3 oceanic SST products from 
collocated ECMWF values for both Version-6 and Version-5. The values shown in a given grid box are the average 
values for that grid box of all cases in which the SST retrieval was accepted either at 1:30 AM or 1:30 PM. The oceanic 
cases shown in gray indicate where we obtained not a single value of QC’d SST for all 14 possible cases (seven days, 
twice daily). Figure 3 represents the spatial coverage and accuracy of “pseudo seven day mean” Level-3 products. The 
seven non-consecutive days included in the figure do not represent a typical seven day Level-3 product, but provide 
valuable information nonetheless. 


Each panel in Figure 3 indicates the mean difference of the Level-3 SST field from its own collocated oceanic ECMWF 
values, the spatial standard deviation over all grid points of the Level-3 differences from ECMWF, and the percentage of 
all possible oceanic grid points that have at least one accepted value over the seven day period (i.e., one not gray). The 
Version-6 Level-3 SST product has much better spatial coverage, with 99.35% of all oceanic grid points 6M -60° S 


being filled, compared to Version-5 with only 92.24%. Moreover, there are large coherent spatial areas in which no 
Version-5 retrievals were accepted on any of the seven days. The spatial standard deviation of the Version-6 Level-3 
SST product errors compared to ECMWF truth is also much smaller than that of Version-5, and the Version-6 area mean 
negative bias is also smaller than that of Version-5, counter to what might have been expected based on Figure 2. 
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Fig. 3 

Figures 4a and 4b show the mean difference of the retrieved ocean surface emissivity e v from that of the Masuda model 
as a function of satellite zenith angle for v = 950 cm" 1 and v = 2400 cm' 1 . The channels are in the longwave and 
shortwave window regions respectively. In these figures, we show statistics separately for AM orbits in dark colors, and 
PM orbits in light colors. In both the longwave and shortwave window regions, Version-6 (as well as Version-6 AO) 
retrieved ocean spectral emissivities as a function of satellite zenith angle are very close to the values expected using the 
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Masuda emissivity model. Differences of Version-5 retrieved £ v from Masuda are much larger than those of Version-6. 
Version-5 values of £ v also show a large spurious feature during the day in the vicinity of a satellite zenith angle of 
-18.24 degrees, which is the viewing angle in which maximum sunglint appears in the field of view. There is no 
appreciable difference between Version-6 and Version-6 AO in results related to ocean values of £ v . 

4.2 T(p) and q(p) retrieval accuracy as a function of yield 

The fundamentals of the methodology used in Version-6 to retrieve temperature profile T(p) and q(p) from AIRS cloud 
cleared radiances R l are basically the same as those used in Version-5. Quality Controlled (QC’d) Version-6 retrievals of 
T(p) and q(p) are significantly better than those of Version-5 for three reasons: 1) Version-6 uses Neural Net generated 
first guesses for T(p) and q(p) in place of the regression generated first guess used in Version-5. The Neural Net first 
guesses are more accurate than the regression guesses, especially under more cloudy conditions. This allows for the 
generation of accurate QC’d Version-6 retrievals under cloudier cases than was achievable in Version-5; 2) Version-6 
has improved QC procedures for T(p) and q(p) than those that were used in Version-5; and 3) Improved Version-6 
surface parameters also allow for improved Version-6 T(p) and q(p) in the boundary layer, especially over land. 

Figure 5 shows statistics of the differences of QC’d Version-5 and Version-6 retrievals from collocated ECMWF truth 
for a global ensemble of cases taken over the seven focus days. Panel (a) shows the percentage of QC’d cases accepted as 
a function of height, panel (b) shows RMS differences of 1 km layer mean temperatures from collocated ECMWF truth, 
and panel (c) shows biases of QC’d 1 km layer mean differences from ECMWF. Statistics are shown for six sets of 
results. We show the results for Version-5 retrievals in red, results for Version-6 retrievals in blue, and results for 
Version-6 AIRS Only retrievals in black. Version-5 did not have QC’d AIRS Only retrievals. The two sets of curves 
shown for each experiment represent results using different T(p) QC error estimate thresholds. Version-5 had only one 
set of QC thresholds, called Standard thresholds. These Version-5 error estimates thresholds were chosen so as to 
provide a middle ground between the highest accuracy, which would be optimal for Data Assimilation (DA) purposes, 
and the highest yield (best spatial coverage), which would be optimal for Climate purposes. Experience using Version-5 
products showed that Standard QC thresholds were optimal for neither purpose. For example, Data Assimilation 
experiments using Version-5 retrievals that passed a tighter set of QC thresholds than found in the official Version-5 
system resulted in significantly improved forecasts compared to those passing the looser Standard QC thresholds. 8 The 
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solid red lines in Figure 6, and subsequent figures, show statistics of Version-5 retrievals passing the tighter QC 
thresholds, which we refer to as Tight QC thresholds, and the dashed red lines show equivalent statistics for the ensemble 
of Version-5 retrievals passing Standard thresholds. Version-6 uses two different sets of thresholds, a very tight set of 
thresholds newly optimized for Data Assimilation purposes (QC=0), and a substantially looser set of thresholds optimal 
for Climate purposes (QC=1). As with Version-5, the solid lines show Version-6 and Version-6 AO results using the 
Data Assimilation (DA) QC thresholds, and the dashed lines show results using the Climate thresholds. Level-3 gridded 
products utilize all cases passing Climate QC. 

In Version-5, all retrievals were either accepted or rejected above 70 mb based on use of different types of tests, even 
before applying the QC procedures. One of the tests that disqualified the entire temperature profile, and flagged the entire 
profile with QC=2 (do not use), is that the retrieved cloud fraction is over 90%. Roughly 83% of Version-5 retrievals 
passed the initial screening procedure, but none of them occurred in near overcast conditions. Version-5 retrievals with 
Tight QC have considerably lower yield than those with Standard QC below 200 mb, with correspondingly smaller RMS 
errors, on the order of IK beneath 300 mb. There is no appreciable difference in Version-5 bias errors compared to 
ECMWF found using either set of QC thresholds. 


Version-6 does not apply any test which eliminates the entire temperature profile, other than the requirement that the 
retrieval runs to completion. Version-6 retrievals using DA thresholds have roughly IK RMS errors throughout the 
atmosphere, with a yield much higher than Version-5 Tight down to about 500 mb. Among other benefits from the Data 
Assimilation perspective, this capability will allow for the assimilation of AIRS temperature products above the clouds in 
storms, as well as in overcast conditions. The yield of Version-6 retrievals with DA QC is lower than that of Version-5 
Tight beneath 500 mb, but with a considerable improvement in mid-lower tropospheric temperature RMS errors, with 
errors less than IK, which has been found to be optimal for Data Assimilation purposes. The yield of Version-6 retrievals 
with Climate QC is extremely high throughout the atmosphere, with a value of about 83% at the surface. Achievement of 
this very high yield is extremely valuable in the generation of more representative Level-3 Climate data set. RMS errors 
of Version-6 retrievals with Climate QC are better than, or comparable to, those of Version-5 with Standard QC down to 
about 700 mb, but with a much higher yield. Beneath 700 mb, Version-6 Climate QC RMS errors are somewhat larger 
than those of Version-5 with Standard QC, but the Version-6 results are essentially unbiased, which is the more 
important statistic with regard to the generation of the Level-3 products used for Climate research. QC’d results for 
Version-6 AO are roughly comparable to those of Version-6 but with a somewhat lower yield near the surface. 
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Figure 6 shows analogous results comparing QC’d 1 km layer precipitable water to that of collocated values of ECMWF. 
Figure 6 contains results for only Version-5 retrievals and Version-6 retrievals. We show results only up to 200 mb, 
above which water vapor retrievals are considered of minimal validity, and are not included in the AIRS Science Team 
Standard Product data set. The relative results regarding Version-5 and Version-6 are analogous to those found for T(p). 
Version-6 q(p) retrievals with DA QC are considerably improved over those of Version-5 in the lower troposphere. This 
improvement is at least partially a result of the improved values of T s , and s v in Version-6 compared to Version-5. As 
with T(p ), Version-6 q(p) retrievals with Climate QC are unbiased, have high accuracy, and contain almost complete 
spatial coverage. 

Figures 5 and 6 provide very important information about the accuracy of Quality Controlled retrievals obtained by 
different retrieval systems each using their own QC procedures. Indeed, the ability of a different retrieval system to 
perform QC is a critical part of that retrieval system, especially in the generation of Level-3 products. Figures 5 and 6 do 
not tell the whole story about the relative accuracy of the retrievals obtained in Version-5 and Version-6, because results 
are shown for different ensembles of cases. Figure 7a compares RMS T(p) errors of Version-6 and Version-5 retrievals 
when evaluated on common ensembles of cases. Results for two such ensembles are shown: an ensemble of relatively 
easier (less cloudy) cases given by those cases accepted in Version-5 using Tight QC (shown in solid lines); and an 
ensemble including much more difficult (more cloudy) cases given by those cases accepted in Version-6 using Climate 
QC, shown in dashed lines. As previously, we show Version-6 RMS errors in blue and Version-5 RMS errors in red. 
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Final Retrievals Version-5 Final and First Guess Version-6 Final and First Guess 
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Version-6 retrievals for the easier (solid line) cases are more accurate than those of Version-5 at all levels, but the degree 
of improvement below 500 mb is relatively small for these easier cases. The accuracy of Version-5 retrievals degrades 
much more rapidly than those of Version-6 for the harder cases (dashed lines). It is for this reason that the Version-5 
retrieval system did not use relaxed QC thresholds that would have provided much for higher yields in the generation of 
Level-3 products from Version-5. Figures 6b and 6c include the same retrieval results shown in Figure 6a, separated into 
results for Version-5 and Version-6 respectively. Each figure also shows the accuracy of the first guesses used in the 
appropriate version, and demonstrates the extent that retrievals in a given version improve on their respective first 
guesses. Both figures show that the physical retrieval improves over the first guess beneath 100 mb on each ensemble of 
cases. Both sets of first guesses are of comparable quality in the mid-lower troposphere when evaluated on the easier 


ensemble of cases, as are both sets of retrievals. The regression-based first guess degrades much more significantly on 
the more difficult set of cases than does the Neural-Net first guess. This allows for the ability of Version-6 to perform so 
well under almost all cloud conditions. This provides an extremely importaint capability in the generation of highly 
representative Version-6 Level-3 products used for climate studies. 

All results shown so far have been for a global ensemble of cases. Table 1 contains a breakdown of two temperature 
profile statistics, the Tropospheric Temperature Metric (TTM) and the Boundary Layer Metric (BLM), evaluated over 
different spatial regions: global; land 50°N to 50°S; ocean 50°N to 50°S; poleward of 50°N; and poleward of 50°S. In this 
table, TTM represents the mean RMS T(p) error over all 1km layers from the surface to 100 mb, and BLM represents the 
mean RMS T(p) error over the six lowest 0.25 km layers from the surface. TTM and BLM results for Version-5 and 
Version-6 retrievals evaluated over the Version-5 Tight ensemble are shown in Table la, and evaluated over the Version- 
6 Climate QC ensemble are given in Table lb. 


Table 1. 7-Day Mean Statistics Tropospheric Temperature Metric (TTM) 
and Boundary LayerMetric (BLM) 


Cases in Common Using the Version-5 Tight Ensemble 


Poleward Poleward 



Global 

Land ±50 

Ocean 

\UX 

lo 

of 50°N 

of 50°S 


TTM 

BLM 

TTM 

BLM 

TTM 

BLM 

TTM 

BLM 

TTM 

BLM 

Version-5 

1.10 

1.29 

1.19 

1.71 

1.04 

1.13 

1.14 

1.50 

1.31 

1.76 

Version-6 0.92 1.16 0.94 1.49 0.86 

Cases in Common Using the Version-6 Climate Ensemble 

0.98 

0.96 1.47 

Poleward 

1.20 1.69 

Poleward 


Global 

Land ±50 

Ocean 

\UX 

lo 

of 50°N 

of 50°S 


TTM 

BLM 

TTM 

BLM 

TTM 

BLM 

TTM 

BLM 

TTM 

BLM 

Version-5 

1.67 

2.57 

1.82 

2.78 

1.65 

2.48 

1.53 

2.39 

1.72 

2.72 

Version-6 

1.11 

1.67 

1.06 

1.75 

1.03 

1.34 

1.12 

1.93 

1.32 

2.02 


Version-6 T(p) retrievals are superior to those of Version-5 with regard to both metrics in all spatial regions. It is 
important to note the improvement of Version-6 Boundary Layer Temperatures compared to Version-5 especially over 
land, even for the easier ensemble of cases. This improvement over land is at least in part a result of the improved values 
of surface skin temperature and surface spectral emissivity retrieved over land. A stated goal for Version-6 included 
improvement of the boundary layer temperature, which has indeed been accomplished. The improvement of boundary 
layer temperatures is even more pronounced when evaluated over the ensemble of more difficult cases. 

Figure 8 shows the spatial distribution of a pseudo-Level-3 seven day field of accepted cases of total precipitable water, 
IV tot, flagged to be of climate quality (QC=0,1). In Version-6, W TO t is flagged to be of climate quality if the water vapor 
profile passes the climate QC test down to the surface. Version-5 used a different procedure. The values shown for 
Version-6 and Version-5 represent the ensemble mean difference, for all accepted cases within that grid box, of the 
retrieved value of W T0T from the collocated ECMWF value of W T0T . We show, in gray, those grid points where we found 
no accepted values of W T0T for any of the seven days, either daytime or nighttime. Statistically, Version-6 seven day 
mean values of W T0T are considerably more accurate than those of Version-5, both with regard to global mean bias and as 
well as to the standard deviation of the errors. Even more important from the climate perspective, spatial coverage of the 
seven day mean Version-6 product, with 99.89% of the grid boxes filled, is much more complete than that of Version-5, 
with 96.12% of grid boxes filled. Moreover, the 3.88% of grid boxes for which no accepted soundings were generated in 
Version-5 tend to come in spatially coherent groups in oceanic areas where low clouds exist. 



PRESSURE (mb) 


7-Day Total Precipitable Water (cm) 

Retrieved minus ECMWF AM/PM Average 
Version-6 Version-5 



Fig. 8 


4.3 Yield and spurious bias trends of T(p) and q(p) 

Version-5 retrievals have two very undesirable characteristics with regard to trends, both of which were considered 
critical to improve upon in Version-6. The first was that the percent yield of accepted retrievals was found to be 
decreasing over time (negative yield trend). The second was that the mean differences of QC’d retrieved temperatures 
from collocated truth values was found to be changing over time as well, especially beneath 300 mb (spurious bias 
trend). In this section, we examine yield and spurious bias trends of Version-5 and Version-6 products. We computed 
these trends by taking the slope of the linear least squares fit passing through the values of the appropriate parameter for 
each of the seven ensemble days as a function of time. Figure 9 shows these trends as a function of pressure for Version- 
5, Version-6, and Version-6 AO. All results show cases with Climate QC (Standard QC for Version-5) because trends 
are most significant with regard to the generation of Level-3 climate research products. It is apparent that Version-6 


Global Trends 

Percent of All Cases Layer Mean Temperature Bias Layer Precipitable Water Bias 

Accepted (%/yr) Differences from ECMWF (K/yr) % Differences from ECMWF (%/yr) 



Vers! on-5 Climate QC 
Vers! on-6 Climate QC 
Verslon-6 AIRS Only Climate QC 
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eliminated the substantial negative tropospheric temperature profile yield trends, on the order of 1% per year, which were 
found in Version-5. In addition, the Version-6 negative T(p) bias trends beneath 500 mb are much smaller than those of 
Version-5, which were on the order of -0.05K/yr. The negative q(p) bias trends found in Version-5 are also substantially 
reduced in Version-6. It is interesting to note that the spurious q(p) bias trends found in Version-5, as a function of 
pressure, tend to follow those of T(p) in sign. This is consistent with physics in that a spuriously cold temperature 
solution for a given case lowers the computed radiance for the water vapor sounding channels in that case. Therefore, if 
the retrieved temperature for a given case is too low, the solution for q(p) will result in a lowered retrieved water amount 
as well, so as to raise the radiances computed in the water vapor sounding channels using the incorrect value of T(p) in 
order to match the cloud cleared radiances for that case. In an analogous fashion, a negative spurious tend of T(p) will 
result in a spurious negative trend of q(p) as well. Trend results for Version-6 and Version-6 AO are similar to each 
other. 


5. SUMMARY 

Our validation studies of Version-6 products compared to those of Version-5 indicate that retrieved Version-6 surface 
skin temperatures and emissivities, as well as temperature and moisture profiles, are superior in every way to those of 
Version-5. This should enhance their utility both with regard to the study of Climate processes, and also for possible 
improvement of numerical weather prediction forecast skill when used in a Data Assimilation mode. Accuracy of 
Version-6 AIRS Only products is only slightly degraded from that of Version-6. This shows that the AIRS Only retrieval 
system would be adequate to use for Data Assimilation and Climate purposes should AMSU-A fail. This also shows that 
while it is desirable for an advanced microwave sounder to accompany and advanced IR sounder, it is not a critical 
component of such a sounding system. This finding is particularly significant towards considerations of the design of an 
advanced sounding system for flight on Geostationary satellites, on which it is more difficult to build a high spatial 
resolution microwave sounder. 
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AIRS/AMSU 


AIRS/AMSU are the advanced IR/MW sounders on EOS Aqua 

AIRS has 2360 channels between 650 cm 1 and 2665 cm 1 with v/Av ~ 1200 

The Goddard DISC has generated products using AIRS Version-5 retrievals starting 
from September 2002 

AIRS Version-6 has significant improvements compared to Version-5 

Version-6 is completed and will become operational at the DISC in late 2012 for 
future processing and reprocessing of all AIRS data 
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Overview of AIRS/AMSU Version-6 Retrieval Methodology 


We analyze AIRS/AMSU observations using a physically based retrieval system 
Independent of GCM except for surface pressure - used to compute expected radiances 
Uses cloud cleared radiances % to determine the solution 

Rj represents what AIRS would have seen in the absence of clouds 

/\ 

Derivation of R, is updated in different steps of the retrieval process 
Basic steps 

1) Neural-Network initial guess using AIRS/AMSU observations: X° 

2) Initial cloud clearing produces Rj°for all channels 

Uses X°and observed AIRS radiances 

/\ n 

3) Sequentially determine surface parameters, T(p), q(p), 0 3 (p), CO(p), CH 4 (p), using Rj 

4) Each step uses its own set of channels^ 0 

5) Derive cloud parameters and OLR using observed radiances R, 

Repeat steps 2) - 5) 

Generate error estimates and use for Quality Control (QC) 

Version-6 will be run both in AIRS/AMSU and AIRS Only (AO) modes 
Version-6 AO retrieval system is analogous but does not use AMSU observations 
Provides a back-up system should AMSU fail 
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Objectives of AIRS/AMSU 


Provide real time observations to improve numerical weather prediction 

Could be^ (used by NCEP, ECMWF) orT(p), q(p) 

Accuracy of T(p), q(p) degrades slowly with increasing cloud fraction 
There is a trade-off between accuracy and spatial coverage 

Using soundings or radiances only in clear cases limits utility of the data 

Provide observations to measure and explain interannual variability and trends 

Must provide good spatial coverage but also be unbiased 
Can be less accurate than needed for data assimilation 
Must not contain systematic data gaps in certain regions 

AIRS Version-5 and Version-6 contain accurate error estimates for T(p), q(p), and R, 

Error estimates and quality flags provide options for use in either weather or climate 
applications 
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Improvements in Version-6 Compared to Version-5 


As in Version-5, following theoretical considerations: 

Tropospheric 15 pm C0 2 channels are used only for cloud clearing 

Gives coefficients to generate clear column radiances R= for all channels 

/\ 1 

Rj in 4.3 pm C0 2 channels and stratospheric 15 pm C0 2 channels are used to 
determine temperature profile T(p) 

This allows for accurate temperature profile soundings under more difficult cloud 
conditions 

/\ 

• The major improvement over Version-5 is the exclusive use of Rj for shortwave 

window channels to determine simultaneously T s , shortwave spectral emissivity 
£sw(v), and spectral surface bidirectional reflectance e sw (v) 

Rj in longwave window channels are used in a subsequent retrieval step to determine 
£lw(v) given T s ^ 

Version-5 solved for surface skin temperature using Rj in both longwave and shortwave 
window channels simultaneously 

• Version-6 uses a Neural-Net initial guess while Version-5 used regression guess 

The Neural-Net initial guess degrades more slowly than regression with increasing 
cloud cover 

• Version-6 also has further improved the methodology for Quality Control of T(p), q(p) 

• Version-6 has improved the algorithms used to compute OLR, cloud products, and trace 
gasses - not discussed today 
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Methodology Used for Version-5 T(p) Quality Control 

Cases are rejected outright if cloud fraction > 90%. Otherwise: 

Define a profile dependent pressure, p best , above which the temperature profile is 
flagged as good - beneath p best profile is flagged as bad 

Use error estimate 8T(p) to determine p best based on comparison with error estimate 
thresholds AT(p) 

6T(p)<AT(p) down to P best 

Temperature profile statistics include errors of T(p) down to p = p best 

Version-5 used Standard AT(p) thresholds optimized simultaneously for weather and 
climate applications - middle ground between accuracy and spatial coverage 

Subsequent research has shown that use of more stringent (Tight) AT(p) thresholds to 
define p best performs better in Data Assimilation experiments in which T(p) is 
assimilated down to p best 

Tight thresholds gives higher accuracy retrievals with less spatial coverage 

Additional research also has shown that looser thresholds (greater spatial coverage) is 
better for climate applications 
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Methodology Used for T(p) Quality Control in Version-6 


Essentially no retrievals are rejected outright unless retrieval fails 

All successful retrievals are accepted down to 30 mb 

QC is otherwise analogous to Version-5 but uses tight thresholds AT A (p) for Data Assimilation 
and loose thresholds AT c (p) for Climate applications 

AT a QC thresholds define p best and AT C thresholds define p good 

QC = 0 down to p best , QC = 1 between p best and p good and QC = 2 below p good 

Data Assimilation experiments use T(p) down to p best 

Climate applications use T(p) down to p good 

Level-3 gridded products use cases with QC = 0 or 1 

AT a QC thresholds were set so as to give RMS errors ~1K for accepted cases 

AT C QC thresholds were set so as to maximize spatial coverage and achieve < 2K 
tropospheric RMS errors with essentially unbiased retrievals 

Results shown are based on retrievals on seven different days in different months and 
years, using ECMWF as "truth" 
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N umb e r 


Surface Skin Temperature Difference 
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7-Day Mean Emissivity minus Masuda 
50° North to 50° South Ocean 
950 cm 1 2400 cm 1 




Zenith Angle Zenith Angle 

Version-5 1:30 AM 

Version-5 1:30 PM 

Version-6 1:30 AM 

Version-6 1:30 PM 

Version-6 1:30 AM AIRS Only 

Version-6 1:30 PM AIRS Only 


Version-6 Ocean surface emissivities are much closer to Masuda model used as truth 
Version-5 Ocean surface emissivities are strongly impacted by sunglint during the day 
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7-Day Average Surface Skin Temperature (K) Non-Frozen Ocean 
Retrieved minus ECMWF AM/PM Average 



rr 


- 9.50 - 6.50 - 3.50 - 0.50 1.50 4.50 7.50 

GLOBAL MEAN= -0.31 STANDARD DEV= 0.57 % Grid Fill 


10.50 

99.35 




- 9.50 - 6.50 - 3.50 - 0.50 1.50 4.50 7.50 10.50 

GLOBAL MEAN= -0.35 STANDARD DEV= 0.73 % Grid Fill 92.24 


Version-6 7-day mean Level-3 SST product has much better spatial coverage 
and accuracy than does Version-5, which has gaps in cloudy areas. 
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7-Day Total Precipitable Water (cm) 
Retrieved minus ECMWF AM/PM Average 
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As with Ocean SST, Version-6 7-day mean Total Precipitable Water has much 
better accuracy and spatial coverage than Version-5 
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PRESSURE (mb) 


Global 7-Day Temperature Profile Statistics 


Percent of All Cases 
Accepted 


1 km Layer Mean RMS (°K) 1 km Layer Mean BIAS (°K) 

Differences from ECMWF Differences from ECMWF 




V5 Tight 

V5 Standard (QC=0) 

V6. DA OC (QC=0, PBsst) 

V 6 . Climate (QC=0,1, PGood) 

V 6 . AO DA QC (QC = 0, PBest) 

V6 . AO Climate (QC=0,1, PGood) 


V5 Tight 

VS Standard (QC=0) 

V 6 DA OC ( QC = 0 , PBest) 

V6 Cl Imate (QC=0, 1 , PGood) 

V6 AO DA OC (QC=0, PBest) 

V6 AO Climate (QC=0,1, PGood) 


V5 Tight 

V5 Standard ( 0C=0 ) 

V 6 DA OC (QC-0. PBest) 

V6 Climate (QC=0,1, PGood) 

V6 AO DA OC (QC=0, PBest) 

V6 AO Climate (QC=0,1, PGood) 


Version-6 T(p) retrievals with DA QC have RMS errors < IK throughout troposphere 

Version-6 T(p) retrievals with Climate QC have much greater yield than Verison-5 with small biases 

Differences between Version-6 and Version-6 AO are small 
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PRESSURE (mb) 


Global Temperature Layer Mean RMS (°K) Differences from ECMWF 

7-Day Two Common Ensembles 

Final Retrievals Version-5 Final and First Guess Version-6 Final and First Guess 



Version-6 Using Version-5 Tight QC V5 Tight FG V6 Using V5 Tight FG 

Version-5 Using Version-6 Climate QC V5 Using V6 Climate V6 Climate 


Version-6 Climate QC V5 Using V6 Climate FG ___ \/6 Climate FG 

Version-6 T(p) retrievals are more accurate than Version-5 on common ensembles, especially 
for the harder (cloudier) ensembles of cases 
Both sets of retrievals improve over their first guesses 

The Neural-Net first guess accuracy degrades much more slowly with increasing cloud cover 
than does regression 
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7-Day Mean Statistics Tropospheric Temperature Metric (TTM) 

and Boundary Layer Metric (BLM) 


Cases in Common Using the Version-5 Tight Ensemble 


Global 

TTM BLM 

Land ±50° 
TTM BLM 

Ocean ±50° 
TTM BLM 

Poleward of 
50°N 

Poleward of 50°S 
TTM BLM 




TTM BLM 


Version-5 

1.10 1.29 

1.19 1.71 

1.04 1.13 

1.14 1.50 

1.31 1.76 

Version-6.02 

0.92 1.16 

0.94 1.49 

0.86 0.98 

0.96 1.47 

1.20 1.69 


Cases in Common Using the Version-6 Climate Ensemble 



Global 

Land ±50° 

Ocean ±50° 

Poleward of 50°N 

Poleward of 


TTM 

BLM 

TTM 

BLM 

TTM 

BLM 

TTM 

BLM 

50°S 





TTM 

BLM 

Version-5 

1.67 

2.57 

1.82 

2.78 

1.65 

2.48 

1.53 

2.39 

1.72 

2.72 

Version-6 

1.11 

1.67 

1.06 

1.75 

1.03 

1.34 

1.12 

1.93 

1.32 

2.02 


TTM is the average T(p) RMS difference from ECMWF over all 1 km layers from surface to 100 mb 
BLM is the average T(p) RMS difference from ECMWF over the lowest six 0.25 km layers 
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PRESSURE (mb) 


Global Temperature Trends 7-Day 

Percent of All Cases (%/yr) Layer Mean BIAS (K/yr) 

Accepted Differences from ECMWF 



Vers I on-6 AIRS Only Climate QC Versfon-6 AIRS Only Climate QC 

Version-5 had a significant negative yield trend and negative tropospheric T(p) bias trends 
compared to ECMWF 

These are significantly reduced in Version-6 and Version-6 AO 
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Summary 


AIRS Science Team Version-6 algorithm determines tropospheric T(p) and T skin using only 
shortwave channels 2358 cm 1 - 2664 cm 4 . The 15 pm tropospheric sounding C0 2 
channels are used only for cloud clearing (as in Version-5.) 

The Version-6 algorithm also uses a Neural-Net start-up state in place of regression. The 
Neural-Net guess degrades much more slowly with increasing cloud cover. 

These modifications result in: 

• Improved soundings of T(p) and SST, day and night. 

Improvements are larger with increasing cloud cover. 

• Improved retrievals of surface spectral emissivity, day and night. 

• The ability to produce AIRS Only retrievals with comparable accuracy to 
AIRS/AMSU retrievals. 


Success of AIRS Only retrievals shows that a Geostationary AIRS-like IR sounder does not 
require a Geostationary MW sounder, provided it has low noise channels extending to 
2500 cm 1 . 
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